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Description 

This invention relates to a method for compressing binary image information, in particular in cases where 
the image comprises text, i.e. only black or white areas. This invention is, therefore, applicable in the electronic 

5 processing of information presented on printed or hand written documents, where the storage of the information 
while the various processing steps are being performed, the storage for future reference or retrieval and the 
treansmission over data links present space and/or time problems. 

The handling of paper documents is a daily routine in today's office environments. Considerations having 
to do with the care for the natural resources required for manufacturing the paper used for the documents, the 

10 speed of preparing and handling the documents to make them serve their purpose, and their storage and pos- 
sible later retrieval, have resulted in efforts to reduce the number of documents circulated and to leave the 
working with the documents to automatic record handling devices. The physical handling of the documents is 
one important aspect in this connection, the other, possibly more important aspect is the treatment of the in- 
formation contained in the documents. 

15 The treatment of the information contained in documents generally involves the acquisition of the infor- 
mation by some reading device, the transformation of the acquired information into a machine-readable code, 
the storing of the coded information for later, and possibly repeated processing, the actual processing of the 
information, and finally the output of the results of the processing, or simply of the unprocessed original infor- 
mation. This output may take visual form, as on a display unit, or in print, or be purely electronic. 

20 The acquisition of the information by a reading device, such as an optical scanner, should be performed 
with a reasonably high resolution lest the information content should become mutilated or partially lost. Ac- 
cordingly, the reading device will produce a high volume of scan data which in turn require a large memory 
capacity for storage. As a typical example, a page of the A4 size (297 x 210 mm), scanned at 100 pels/cm 
(where "pel" stands for picture element and is either a white or black dot), requires about 700 kBytes of storage 

25 space. Even a rather modest number of documents, say a couple of thousands, would call for an unreasonably 
big memory. 

To alleviate this problem, known document scanning systems are provided with data compression routines 
which save about one order of magnitude so that the compressed data of a scanned A4 page can be accom- 
modated on 25 to 75 kBytes of storage space, depending, of course, on the content of the scanned page. Very 
30 clever algorithms, based on arithmetic coding, can achieve a further reduction of about 1 6%. See, for example, 
R. Arps, T. Friedman, G. Langdon and T. Truong, " Adaptive Binary Image Compression: Chip Status and Ap- 
plications", Image ITL, May 1986. 

It goes without saying that any compression system must permit later decompression of the information, 
be it for processing or output Whether the compressing or decompressing can be done with a certain loss of 
35 information, or degradation of quality, depends on the purpose for which the output is needed, and on the 
amount of redundancy contained in the original image. 

In a number of applications where very large volumes of documents must be handled, such as, e.g., in 
connection with a census, it is essential to increase the compression ratio to its limits, at least by another order 
of magnitude. 

AO One possible approach to this problem is described in "Combined Symbol Matching Facsimile Data Com- 
pression System" by W.K. Pratt, P.J. Capitant, W. Chen, E.R. Hamilton and R.H. Wallis, in Proc. IEEE, Vol. 68, 
No.7, July 1980. 

There it was proposed to feed binary images into a character matching process. Recognized characters 
will be represented very efficiently by means of their alphanumeric form. Then the remaining residue informa- 
45 tion will be compressed separately as a conventional binary image. This amounts to an about two-fold increase 
in compression ratio, although the efficiency of this scheme largely depends on the percentage of recognized 
characters and on the degree of digitization noise. 

An improved technique is disclosed in US-A-4,499,499 Brickman et ah, where instead of single character 
recognition the matching of large symbols, such as individual words in the text, is considered. This technique 
50 requires, however, the storing of an addressable library of facsimile symbols, or words, and it does not work 
for words not found in that library. 

In the ICL Technical Journal 5 of May 1986, No. 1 , Oxford(UK), pp. 123-146 an overview covering several 
coding and data compression methods is given. It is proposed to classify and segment a document into regions 
containing exclusively either text or graphics. It is further proposed to compress these segments according to 
55 different compression methods. However, no criteria are mentioned which allow to determine the degree of 
compressibility of a segment 

From the patent US-A-4,229,769 to Kurahayashi, a process is known in which the scanned image of a 
document is selectively thinned and thereby compressed. The patent also discloses a process to eliminate iso- 
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lated dots prior to the thinning step. 

It is an object of the present invention to propose a method for the compression of image data with a com- 
pression ratio considerably larger than offered by conventional compression methods. The inventive method 
is based on the cognition that this cannot be achieved if one was to insist on lossless compression, i.e. com- 
5 pression that would permit an exact reconstruction of the original information. Actually, in the majority of prac- 
tical applications involving text images, lossless compression is not essential. It suffices to have a legible text 
which subjectively is indistinguishable from the original. 

A trivia! approach to this would be to reduce the resolution of the scanning. Indeed, many conventional 
scanners and facsimile devices do allow this option. However, while this approach might be acceptable for 
10 some types of image, such as large handwritten text, it would mean unacceptable distortion for other types, 
like those containing small print. Even for medium-sized text, simple reduction of resolution would cause image 
quality deterioration which might be annoying to the user. 

In contrast, the present invention pretends to solve this problem through a method featuring an overall 
high compression ratio by means of "lossy" encoding where this is tolerable, and by means of "lossless" en- 
15 coding where this is essential in maintaining the reproduction quality. This proposal, accordingly, involves the 
use of variable compression ratios which depend on an evaluation of the nature of the binary image at hand, 
with lossy compression being limited to large low-frequency areas (where quality deterioration is subjectively 
unnoticeable, and with small high-frequency areas being compressed losslessly. 

The method of the invention still even go one step further Even in areas where lossy compression is found 
20 feasible, it will not be applied indiscriminately. Instead, the compression ratio to be used will in each particular 
case be adapted to the preservation of the continuity and smoothness of the lines in order to guarantee an 
overall excellent image quality. 

The method of the invention is claimed in claim 1. 

Details of the invention will now be described by way of example with reference to the drawing in which: 
25 Fig. 1 is an original, completed form with a colored background; 
Fig. 2 is the form of Fig. 1 after filtering; 

Fig. 3 is a flow chart showing the determination of the compressibility; 
Fig. 4 is a reconstruction of a conventionally compressed handwritten text; 
Fig. 5 is a reconstruction of the text of Fig. 4 compressed in accordance with the invention. 
30 The method in accordance with the invention essentially comprises the following five discrete steps: Pre- 
filtering, Evaluation of Compressibility, Connectivity-oriented Subsampling, Lossless Compression and De- 
compression, and Reconstruction. Each of these steps will hereinafter be reviewed in detail. 

1. Prefiltering 

35 

Prior to the actual evaluation for compressibility, the scanning data must be "cleaned" by removing all small 
superfluous black patches which might result either from the scanning noise or from a non-uniformity of the 
document background. There is a wide variety of filters available which may be used. Reference is made to 
T. Kimoto and R.G. Casey, "Edge Smoothing Algorithms for Horizontal and Vertical Line Patterns in Binary 

40 Images", IBM Research Report RJ 4156, November 1984, and F.M.Wahl, "A Binary Image Processor for Docu- 
ment Quality Improvement and Data Reduction" IBM Research Report RZ 1438, December 1 985. 

The optimum choice of the filter depends on the character of the specific application and on the availability 
of the computational power. One preferred filter which is both, fast and effective for a great variety of practical 
applications, operates on the principle of removing from the binary image all of the black pixels f pixel" stands 

45 for 'picture element* ), except for those which either belong to a large black segment or which are adjacent to 
such segment. A large segment in this context is defined as a run of three black pixels in a row, either horizontal 
or vertical. If the following information pattern (Array 1) 



w ...11111111111111 1 

1 111 11 1 1 

111 1 1 1 

1.1111 1 

55 



is passed through such a filter, the result will be as follows (Array 2): 
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• ..11111111111111 1 

111 1 

111 1 

1111 1 



10 Clearly, as a result of the filtering step, small features, which human vision tends to interpret as noise, will 
disappear. At the same time, written characters, even if they are only one pixel wide, will remain unaltered. 

As an example for the efficiency of this approach, reference is made to Figs. 1 and 2. Fig. 1 is a copy of 
an original form having a colored background created by a diffused grid of colored dots. After scanning, the 
data are overloaded with a great deal of noise stemming from the colored background. This noise is, of course, 

15 subjectively annoying, but it also consumes a lot of valuable storage space, and in addition prevents efficient 
compression. Having passed the scanning data through a filter of the characteristics mentioned above, the 
background noise has disappeared from the data, as shown by Fig. 2 which is a reconstruction generated from 
the filtered data. 

While the filter described above was assumed to be designed for the removal only of small black patches 
20 on a white background, a similar filter can be constructed for the removal of small white patches on a black 
background, such as white noise on a normally black line element Preferably, both types of filter might be 
chained up to be able to deal with black and white noise areas. 

2. Evaluation of the Degree of Compressibility 

25 

Of course, in order to save processing time and storage space, one would prefer to apply lossy compres- 
sion to the entirety of the data representing a scanned binary text image. However, any binary image comprises 
areas with a high information density, and other areas where the information content is low. Lossy compression, 
therefore, when applied to high-density information, will inevitably lead to a degradation of the intelligibility of 

30 such information. The method of the invention, therefore, pretends to apply the lossy compression only to those 
parts of the information where the intelligibility is not in danger of being noticeably degraded. Thus, lossy com- 
pression should be applied only in the low-frequency areas where characters are drawn by relatively wide lines, 
and with large distances between words. 

To do this, the image will be partitioned into mutually exclusive segments, and an individual compression 

35 ratio will be assigned to each of the segments. The partitioning may be made in an arbitrary manner governed, 
of course, by the desire to have the smallest segments for which a statistical compression scheme is still mean- 
ingful. 

This task is similar to the differentiation between image and text segments of a data array, for which a 
proposal was made by K.Y. Wong, R.G. Casey and F.M. Wahl, "Document Analysis System", IBM Journal of 
40 Research & Development, Vol. 11, November 1982, pp. 647-656. However, the specific nature of the problem 
tackled with the present invention requires a solution quite different from the one proposed in this reference. 

It will be clear to those skilled in the art that the feasibility of performing lossy compression depends oh 
the widths of white and black intervals in the original text image. It is, therefore, necessary to have a quantitative 
measure of the relative width of these intervals, with an absolute width value governing the threshold beyond 
45 which the discrimination commences. Two alternative approaches may be considered: 

a. The Pixel-Based Approach 

The idea is to scan the image line by line and to determine, for each block, the average length for all, black 
50 and white intervals. For the sake of simplicity, in the following discussion it will be assumed that always two 
consecutive lines constitute one block. Naturally, the existence of large black or white areas, such as wide 
page margins, should have no impact on the determination of the compressibility. 

The partitioning procedure will now be explained with reference to the flowchart of Fig. 3. In its f irst step, 
the procedure ensures that the registers designated as m and n are cleared. These registers may, for example, 
55 simply be two places in the main memory that accommodate two integer numbers. It is then determined wheth- 
er the end of the segment under evaluation has already been reached. If so, a branch is made to a check of 
the value for n. If it exceeds a predetermined threshold r\, say 5, then a new segment will be addressed. If it 
does not exceed that threshold, a new run of black (1) or white (0) is taken into consideration to determine its 
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uninterrupted length w. 

A check is then made as to whether the length w exceeds the threshold w t . In one embodiment of the in- 
vention, a threshold value of w t = 12 was found to be appropriate. If that threshold is exceeded, new run is 
addressed. If not, the value of n is incremented by 1, the value of m is increased by w. This process continues 
5 until the end of the segment is encountered. If n is still smaller than the threshold rit, lossy compression is per- 
formed on that segment If n is larger than n t , lossy compression is performed only if 



which may be set to 4, for example. 

It is to be noted that the choice of the design parameters, such as the threshold values referred to above, 
is a matter of experimentation. The values given herein were determined empirically, but they are not meant 
15 to be limiting in any way. The values may be varied as necessary in order to increase or decrease the share 
of the compressed areas, and to, thus, choose the trade off between image quality and desired degree of com- 
pression. 

b. The Byte-Based Approach 



While the pixel-based approach outlined under 1) above performs quite well, it might under certain cir- 
cumstances turn out to be computationally cumbersome. This can be improved by packing, or re-packing, the 
pixels into a larger "containment". Indeed, in the majority of practical applications, image data are packed in 
the byte mode, where one byte represents eight consecutive pixels. Hence, considerable time will be required 

25 for the translation of the data to the "normal" raster mode in order to compute the length of all the black and 
white intervals. In this context, "normal" means that a single number is assigned to each pixel. Thus, if one 
has to perform arithmetic operations on pixel values, one has to first translate each byte into the corresponding 
8-pixel values. This could, e.g. be done by using a look-up table with 256 entries, one for each possible byte 
value, and eight output ports, one for each pixel. 

30 The preferred procedure comprises the following four steps: 

1) The number m of non-zero bytes is first computed. 

2) Each byte is translated into a binary number. This binary number equals 1 if the given byte contains a 
small black or white run, and it equals 0 in all other cases. A byte is determined to actually contain such a 
small black or white run if it does not equal either 00000000 or 11111111, when, at the same time, the first 

35 pixel of the byte equals the last one. 

3) The number n of pairs of consecutive "1"-s is computed. 

4) Lossy compression is initiated if n is smaller than a certain predetermined threshold (which may be 3, 
for example), or if n is smaller than a given fraction of m (say less than 5%). The mentioned threshold and 
fraction depend on the specific application and should be determined experimentally. 

40 While the horizontal and vertical distances are generally of the same order of magnitude, it might be ad- 
vantageous in certain cases to apply the test mentioned before to the transposed version of the original image: 
Then vertical distances will be considered. Moreover, it might be useful to develop the compressibility criterion 
from a combination of the regular and transposed versions of the original image, although the experiments 
carried out suggest that in most practical cases the consideration of the horizontal distances, as described 

45 above, is sufficient 

It should be further noted that it might be useful to apply the test to two consecutive lines simultaneously, 
where the length of the short run would be defined as the larger of two consecutive intervals. As a result, the 
impact of one-pixel wide noise will be mitigated. This refers in particular to cases where a pixel is located at a 
line edge. Then either a 1 or 0 value can be obtained. Hence, at the edges of straight lines one can see a one- 
50 pixel layer of scanning noise. This is, of course, not the only possible noise effect, but it very important from 
the point of view of the compressibility test 

Consider, for example, the following long horizontal line (Array 3): 



10 




20 
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00000000000000000000000000000000000 
00000000000000000000000000000000000 
001 10000011 1 1 1000001 1 1 1 000001010101 

11111111111111111111111111111111111 
11111111111111111111111111111111111 

01000011000101010101000011111111100 

w ooooooooooooooooooooooooooooooooooo 

Obviously, it can be subjected to a lossy compression without any discernible image quality degradation, 
however, on the edge (third and sixth lines from the top) a lot of short black and white sequences might cause 
the decision that this area is incompressible. 

15 

3. Connectivity-Oriented Subsampling 

In the connectivity-oriented subsampling of the original image, the aim is to reduce the resolution by some 
factor in both horizontal and vertical directions. Assuming the factor is chosen to be 2, the image will have to 

20 be partitioned into 2 x 2 blocks, and a one-pixel representation for each block will have to be found. 

Blocks having less than two black pixels will be regarded as white. Their average representative value will 
be 0. Blocks having more than two black pixels will be considered as black, and their average representative 
value will be 1 . The question is what would be the most appropriate decision for blocks having exactly two black 
pixels? One obvious approach would be to use some arbitrary decision mode. For example, "problematic* 

25 blocks will be assigned either always 0 or always 1 . Unfortunately, if such an approach were to be adopted, 
it would either cause the disappearance of all narrow (one pixel wide) lines or would merge some originally 
disjoint lines. In any case, the deterioration in image quality would be unacceptable. 

A preferred approach is based on the consideration of the vicinity of each "problematic" block. Under this 
scheme, a value 0 is assigned to each block having two black pixels, except for those cases where such an 

30 assignment would break a connection between two lines. In this last case, the assignment would be switched 
to 1 . One possible embodiment of such an assignment scheme would be the following iterative procedure (still 
assuming that the factor of reduction is 2): 

a. The original binary image array O is partitioned into mutually exclusive 2x2 blocks. 

b. The final array O r (i.e. the one with reduced resolution) is created by setting to 1 all pixels which corre- 
35 spond to the blocks of O and where the number of black pixels exceeds 2. 

c. Traverse the original array O on a block-by-block basis, for each block having exactly two black pixels. 
Denote by A through H all neighboring pixels (in array O r ) as depicted below, with the pixel under consid- 
eration being designated "X" (Array 4): 



D X E 
F G H 

d. If B=D=E=G=1, then X constitutes a narrow opening on the black background. Accordingly, X is set to 
O, and the consideration is shifted to the next pixel of O (or to the corresponding Block of O). 

e. If all of the pixels A through H in the vicinity of X are white (O), then we are dealing with a narrow black 
50 line. X is then set to 1 , and the next pixel is taken into consideration. 

f. If D=E=0, and in O there is a connection from B to G, then X is set to 1, and the next test below is per- 
formed. 

The existence of the connection between B and G will be established by taking, from O, a 6 x 6 array 
T which consists of the pixels belonging to the block under consideration and its eight nearest neighbors, 
55 i.e. all the pixels belonging to the blocks in the scheme above. Per definition, the connection exists if each 

line of T has at least one black pixel. 

g. If B=G=0, and in O there is a connection between D and E, then X is set to 1, and the scheme proceeds 
to the next test below. 

6 



EP 0 411 232 B1 



The existence of the connection between D and E will be established by considering the array T which 
was defined in the previous stage. The connection exists if each column of T has at least one black pixel, 
h. At this stage of the process there remain only three possibilities: Between the pixels B t D, E, G there 
are one, two or three black pixels. 
5 For each of these cases, the decision logic will be as follows: 

1) Between B, D, E, and G there is exactly one black pixel. 

This case is applicable mainly to the pixels located on the edges of of relatively thick lines. Accordingly, 
an arbitrary decision mode may be adopted. However, in order to preserve the line width, X was chosen 
as X = 1 for lines coming from the top and left directions, and X = 0 for lines coming from the right and 
10 from the bottom. Accordingly, 

a) If B = 1, and D=E=G= 0, then set X to 1 

b) tf D = 1, and B=E=0,then set X to 1. 

c) Else, set X to 0 and proceed to the next pixel. 

2) Between B, D, E and G there are exactly two black pixels. 

15 a) If the black pixels are located opposite one another (B=G=1, or D=E=1), then set X to 1. 

b) If the black pixels are adjacent to each other, i.e., if B=D=1, for example, the X should be set to 1 
provided the number of black pixels in the original image within the block corresponding to B (or D) 
does not exceed 2. 

c) Else, set X to 0. 

20 3) Between B, D, E and G there are exactly three black pixels. 

In this case, A should be set to 1 unless it is located in the vicinity of a curved line (say in the form of the 
character "c") so that improved image quality will be obtained by assigning X=0. 

The decision regarding the curvature of the line in the vicinity of X can be reached with consideration being 
given to the 4 x 3 array Ti defined as follows (Array 5): 

25 



A 


B 


C 


F 


G 


H 


A 


0 


F 


C 


E 


H 



It is customary to say that the curvature is high (i.e. X = 0) if in each line of T, there is at least one black 
pixel. 

35 

4. Lossless Compression and Decompression of the Subsampled Image 

The reduced original file O r will be compressed and stored utilizing any of the conventional techniques for 
lossless compression of binary files, like MMR or Arithmetic Coder. Naturally, since the file O r contains only 
40 a quarter of the number of pixels as compared to the original file O, it can be coded more efficiently. It has 
been found experimentally that the subsampled file O r can be represented by a code two times shorter than 
that necessary for encoding the original file O. 

5. Reconstruction of the Original Image and Postf iltering 

45 

The reconstruction of the original file will be performed by reversing the stages explained above. First, 
by lossless decompression, subsampled file O r will be retrieved. Then an expanded version 0 will be obtained 
such that each pixel of O r will be replaced by the 2 x 2 block of pixels having equal value. 

Since the subsampling procedure of the invention is designed in a way to preserve the connectivity of the 
so original file, the reconstructed file 0 will for practical purposes be as legible as the original O. However, since 
the subsampling causes a increase in effective pixel size, certain lines might have a somewhat "rugged ap- 
pearance". 

Consider, for example, the following short section of the subsampled image O r (Array 6): 
55 . 1 

1 . 
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Such a configuration might have resulted from a diagonal line in the original image O, as depicted below 
(Array 7): 



10 



15 



8): 



After subsampling and subsequent expansion of the image above, the array shown below wOl result (Array 



20 


• • • 




25 


• • • 

• • • 

. . 1 


1 . . . . 




. . i 


i . . . . 




1 » • 
• I • 


• • • • • 

■ • • a • 



30 



35 



Clearly, the original line is smooth while the reconstructed line is flawed by a noticeable "staircase effect 0 
One obvious approach to solve this problem would be to expand segments of the type (Array 9): 

. 1 



into a filtered array like the one following (Array 10): 



40 



45 



..11 
.111 
111. 
11.. 



While this technique is adequate for the example shown above, it might cause quality deterioration in cer- 
tain areas of the image, particularly at right-angle intersections of two lines, as shown in the next array (Array 
11): 



50 



55 
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In this case, an indiscriminate transformation of the before-last Array 6 of O r into Array 10 in 0 will distort 
sharp angles. Therefore, the simple solution outlined above will be unacceptable in practical applications. An 
15 alternative approach in accordance with the invention uses post-filtering of the expanded image 6 in order 
that diagonal lines (like the one of Array 7) will look smooth, and at the same time sharp angles will remain 
unaltered so as to maintain the subjective impression of a crisp image. This approach is computationally simple 
enough for the majority of the pixels so as to permit its execution on PC-based systems, with more sophisti- 
cated test being required only for relatively few "problematic" areas. The filtering logic proposed in this con- 
20 nection is the following: 

a. Traverse the image O n making no changes unless a "problematic" area (Array 12) is located: 

1 A A 1 

25 or 

B 1 1 B 

wherein A or B, or both equal 0. 
30 b. Consider the array shown below (Array 13). The discussion will be limited to a single array of this type 
although, in practice, one must consider all of the possible orientations this array can have. 

A B C D 

35 E F . 1 

G H 1 I 

1) If C=1 and B=F=H=0, then we are dealing with a small curve, and Array 13 should be transformed 
40 to the following Array 14: 



45 
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, 1 


1 


I 


I 



50 

2) If C=D=E=F=0, then we are dealing with a diagonal line and the array should be transformed to the 
following (Array 15): 



55 
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A A 

A A 

11 

5 . • . J H 1 1 1 

GGHH1 1 II 
G G H H 1 1 II 

w wherein J = 1 if both G and H equal 1 . 

3) Else, we are dealing with the right angle between the two tines and no further action need to be 
taken, i.e. one proceeds to the next pixel. 

Possible Substitution of the Five Stages of the Method 

15 

The description so far dealt with the preferred embodiments of the five stages of the inventive method. 
However, each stage may be modified in a certain way in order to better match the specific needs of any par- 
ticular application. Some of these possible modifications will be described hereafter. 

1. The prefiltering stage as described herein may be changed to another technique capable of reducing 
20 the scanning noise, like the one taught by T. Kimbto and R.G. Casey, "Edge Smoothing Algorithms for Hor- 
izontal and Vertical Line Patterns in Binary Images" IBM Research Report RJ 4156 (45944) November 
1984, or F.M. Wahl, "A Binary Image Processor for Document Quality Improvement and Data Reduction", 
IBM Research Report RZ 1438 (52141) December 1985. In certain cases, e.g., where the noise is very 
low, or where there is little time and one is willing to trade performance quality for enhanced running time, 

25 it might be advisable to discard this stage altogether. 

2. Regarding the fast evaluation of the degree of compressibility, two alternative approaches have been 
described above: one pixel-based, the other byte-based. In both approaches, one can, of course, modify 
the various decision thresholds. Moreover, it is conceivable that in applications where the compressibility 
can be taken as granted, like compression of handwritten texts, this stage is entirely superfluous. 

30 3. As far as the subsampling stage is concerned, it is important to perform this operation in order to guar- 
antee that connectivity be preserved and that no disjoint features will be merged unintentionally. The dis- 
cussion above of this step was based on the specific example of a subsampling factor of 2, both in x- and 
y-directions. While this subsampling factor is believed to be the most preferable, the factor may as well 
be chosen to be *2. In cases where a very high compression ration is desired, the subsampling operation 

35 may be performed in an iterative manner, with the effective subsampling factor being 2, 4, 8, etc. 

4. To perform lossless compression, a variety of conventional coding schemes are available, such as Huff- 
man, MMR or Arithmetic Codec. 

5. An alternative to the post filtering described above would be the use of the filter described in the Kimoto 
et at. reference mentioned before in connection with the prefiltering step. Again, in certain cases one might 

40 be willing to accept some degree of image quality degradation and, therefore, skip the postf iltering pro- 
cedure. 

The hardware implementation of the various operations involved in the inventive method does not require 
an inventive step: Those skilled in the art will immediately realize that there are various conventional ways to 
implement the design logic required. One preferred way is the use of look-up tables (LUT). For example, in 
45 the subsampling operation, there are three cases to be considered. The differentiation between them is based 
on the values of certain four pixels (here denoted by B, D, E and G). It will be easy for a skilled person to con- 
struct a LUT with 16 (2*) rows and three columns, which would provide a fast way to determine which case is 
valid at any given time. 

The inventive method is advantageous over the existing lossless compression techniques in the sense that 
so it provides a considerable increase in compression ratio. An improvement by a factor of 2 is by no means ex- 
aggerated. Of course, this improvement is achieved at the expense of some deterioration of image quality. 
However, owing to the associative approach of the method of the invention, i.e., the compression scheme used 
on each particular image element being adapted to the characteristics thereof, the reconstructed binary texts 
are subjectively almost indistinguishable from the original from which they were derived. 
55 As an example, consider the handwritten text of Fig. 4. Uncompressed, the text requires 80/080 bytes of 
storage space. After conventional lossless compression, the same text can be represented by only 2217 bytes. 
Fig. 4 shows the text so compressed. With the compression scheme of the inventive method, the same text 
can be compressed to mere 718 bytes. The image reconstructed from those 718 bytes is shown in Fig. 5. Al- 
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though a more than three-fold higher compression was performed as compared to the conventional lossless 
compression, the legibility of both, the conventionally compressed image of Fig. 4 and of the reconstructed 
image of Fig. 5 compressed in accordance with this invention, is virtually the same. 

It is pointed out that owing to the specific nature of the inventive method, its implementation is very simple 
5 from a hardware point of view. Also, most present-day personal computers provide sufficient computing power 
to accommodate this method. 



Claims 

10 

1. Method for high-quality compression of binary text images for storage and possible future processing in- 
cluding transmission over data links, involving the scanning thereof on the record carrier on which they 
are presented in a pixel-by-pixel raster scan and deriving a string of digital data from the scanner output, 
and partitioning said image into segments, characterized by the steps of determining the degree of com- 
15 pressibility of a segment by taking into consideration the number of unintenruped runs of '1' or '0' within 

said segment and assigning according to said degree of compressibility one of at least two different com- 
pression ratios, one of which being lossy, to the data of said segment and compressing said data in ac- 
cordance with said one compression ratio. 

20 2. Method in accordance with claim 1 , characterized in that the average length of said number of uninter- 
rupted runs of "1" or "O" in one segment is determined to determine the degree of compressibility of said 
one segment 

3. Method in accordance with any of claim 1 or 2, characterized in that at the start of each segment, two 
25 registers for storing a first and a second number (m, n) are cleared, that all lengths (w) of uninterrupted 

runs of "0" or "1" are determined, that if a length (w) is larger than a predetermined length value (wj, it is 
ignored, otherwise, the first number (n) is incremented by 1, and the second number (m) by w, and that 
at the end of the segment the feasibility of compression is determined by checking whether n is lower 
than a predetermined first threshold (n^, or whether the ratio m/n exceeds a predetermined second thresh- 
^ old ([m/nl), in which cases lossy compression is performed while in all other cases lossless compression 

is performed. 

4. Method in accordance with claim 3, characterized in that the said predetermined length value (wj is 12, 
that said first threshold (nj is 5, and that said second threshold ([m/nJO is 4. 

35 5. Method in accordance with any of claim 1 or 2, characterized in that for each segment the number (m) of 
nonzero bytes containing at least one "1" respectively is counted and each such byte is translated into a 
binary number which - equals 1 if the byte under consideration contains a small black or white interval, 
that is, that byte does not equal either 00000000 or 11111111 and the first bit of the byte equals the last 
one, and which - equals 0 in all other cases, that the number (n) of pairs of consecutive "1"-s is counted, 

40 and that lossy compression is performed if n is smaller than a predetermined threshold and/or if n is small- 

er than a predetermined fraction of m. 

6. Method in accordance with claim 5, characterized in that the threshold is 3, and that fraction equals 5 %. 

45 7. Method in accordance with any the preceding claims, characterized in that said step of determining the 
degree of compressibility of the individual features of the original image is preceded by a prefiltering step 
involving the filtering of the scan data prior to compression through forced changing of all isolated "1" pixels 
as well as all horizontal and vertical sequences of "1" pixels to '0* if the said sequences are shorter than 
a predetermined length. 

so 

8. Method in accordance with claim 7, characterized in that the said predetermined length is set to three pix- 
els. 

9. Method in accordance with any of the preceding claims, characterized in that the segments are defined 
55 by partitioning the entire image area into mutually exclusive segments of uniform size. 

10. Method in accordance with claim 9, characterized in that the mutually exclusive segments each corre- 
spond to a line or to two consecutive lines of scan of the original binary text image. 
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11. Method in accordance with any of the preceding claims, characterized in that the lossy compression in- 
volves steps of defining 2x2 blocks of pixels, and developing for each such block a kbit representation 
by assigning a "0"-value to all blocks having less than two black pixels, and a "1 "-value to all blocks having 
more than two black pixels, and by assigning a '0"-value to all blocks having exactly two black pixels, 
5 provided such an assignment does not break a connection between two lines and does not distort line 

curvature. 



Patentanspruche 

10 

1 . Verfahren zur qualitativ hochwertigen Komprimierung binarer Textbilder zur Speicherung und moglichen 
zukOnftigen Verarbeitung einschlieOlich der Ubertragung uber Datenverbindungswege, bestehend aus 
der Abtastung der Datent rager, auf dem sie in einer Pixel-Pixei-Rasterabtastung dargestellt sind, und der 
Ableitung einer Kette digitaler Daten von der Scannerausgabe, charakterisiert durch die Schritte der Be- 
ts stimmung des Grads der Komprimierbarkeit eines Segments durch Berucksichtigung der Anzahl unun- 

terbrochener Aneinanderreihungen von '1' und '0' innerhalb des genannten Segments, der Zuordnung ei- 
nes von mindestens zwei unterschiedlichen Komprimierungsver haltnissen entsprechend des genannten 
Grads der Komprimierbarkeit, wobei eines der Komprimierungsverfahren mit Vertusten von Daten des 
genannten Segments einhergeht, und der Komprimierung der genannten Daten in Obereinstimmung mit 
20 dem genannten einen Komprimierungsverhaltnis. 

2. Verfahren gem§S Anspruch 1, dadurch charakterisiert, daft die durchschnittliche Lange der genannten 
Anzahl ununterbrochener Aneinanderreihungen von T oder '0' innerhalb eines Segments bestimmtwird, 
urn den Grad der Komprimierbarkeit des genannten einen Segments bestimmen zu konnen. 

25 

3. Verfahren gemaB einem der Anspruche 1 0der 2, dadurch charakterisiert, daft zu Beginn jedes Segments 
zwei Register zur Speicherung einer ersten und einer zweiten Zahl (m, n) gel oscht werden, daft alle Lan- 
gen (w) ununterbrochener Aneinanderreihungen von '0/ oder T bestimmt werden, daS, wenn eine Lange 
(w) gro&er ist als ein vorbestimmter LSngenwert (wO, dieser ignoriert wind; ansonsten die erste Zahl (n) 

^ v urn 1 , die zweite Zahl (m) urn w inkrementiert wird, und daB am Ende des Segments die Durchfuhrbarkeit 
der Komprimierung bestimmt wird, indem gepruft wird, ob n geringer ist als ein vorbestimmter erster 
Schwellenwert (rn) oder ob das Verhaltnis m/n einen vorbestimmten zweiten Schwellenwert ([m/nJt) uber- 
schreitet; in diesen Fallen wird eine Verlustkomprimierung durchgef unit, wahrend in alien anderen Fallen 
eine verlustfreie Komprimierung durchgef uhrt wird. 

35 

4. Verfahren gemaB Anspruch 3, dadurch charakterisiert, dad der genannte vorbestimmte Langenwert (wj 
12 ist, da& der genannte erste Schwellenwert (nj 5 ist, und daB der genannte zweite Schwellenwert ([m/ny 
4 ist. 

5. Verfahren gemafc einem der Anspruche 1 oder 2, dadurch charakterisiert, daft fur jedes Segment die An- 
40 zahl (m) an Nicht-Null-Bytes, die mindestens je eine "1" enthalten, gezahlt wird und jedes dieser Bytes in 

eine Binarzahl umgewandelt wird, die - gteich 1 ist, wenn das in Betracht befindliche Byte ein kleines 
schwarzes oder wei&es Intervall enthalt, das hei&t, dieses Byte ist nicht gleich 00000000 oder 11111111 
und das erste Bit des Bytes stimmt mit dem letzten uberein, und die • in alien anderen Fallen gleich Null 
ist, das hei&t, dak die Anzahl (n) der Paare aufeinanderfolgender "1" gezahlt wird, und dad eine Verlust- 
45 komprimierung durchgefuhrt wird, wenn n kleiner ist als ein vorbestimmter Schwellenwert und/oder wenn 

n kleiner ist als ein vorbestimmter Anteil von m. 

6. Verfahren gemafc Anspruch 5, dadurch charakterisiert, daft der Schwellenwert 3 ist und daB der Anteil 
5% betragt. 

50 

7. Verfahren gemaB jedem der vorherigen Anspruche, dadurch charakterisiert, daft vor dem genannten 
Schritt zur Bestimmung des Grads der Komprimierbarkeit der einzelnen Eigenschaften des ursprungli- 
chen Bildes ein Vorf ilterungsschritt stattf indet, der vor der Komprimierung die Filterung der Abtastdaten 
einbezieht, bevor die Komprimierung durch erzwungene Anderung aller isolierten "1"-Pixel auf "0" statt- 

55 f indet, wenn die genannten Sequenzen kurzer sind als eine vorbestimmte Lange. 

8. Verfahren gemaB Anspruch 7, dadurch charakterisiert, da& die genannte vorbestimmte Lange auf drei 
Pixel gesetzt wird. 
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9. Verfahren gemaB jedem der vorherigen Anspruche, dadurch charakterlsiert, dad die Segmente durch Par- 
titionierung des gesamten Bildbereichs in gegenseitig ausschlieBliche Segmente einheitlicher GroBe de- 
finiertwerden. 

10. Verfahren gemaB Anspruch 9, dadurch charakterisiert, daB die gegenseitig ausschlieBlichen Segmente 
je einer Zeile oderzwei aufeinanderfolgenden Zeiten der Abtastung des ursprunglich binaren Textbildes 
entsprechen. 

11. Verfahren gem§B jedem der vorherigen Anspruche, dadurch charakterisiert, daB die Verlustkomprimie- 
rung zwei Schritte zur Definition von 2x 2 Pixelbldcken umfaBt, und daB durch Zuordnung eines "0"-Werts 
an alle Bidcke mtt weniger als zwei schwarzen Pixel n fur jeden genannten Block eine 1-Bit-Darstellung 
entwickelt wird, und durch Zuordnung eines "1 "-Wert s an alle Blocke mit mehr als zwei schwarzen Pixeln , 
und durch Zuordnung eines "O'-Werts an alle Blocke mit genau zwei schwarzen Pixeln, vorausgesetzt, 
daB eine solche Zuordnung nicht die Verbindung zwischen zwei Linien aufbricht und nicht die Krummung 
einer Linienkurve ablenkt 



Revendications 

20 1. Procede destine a la compression de haute quaiite d' images de texte binaires pour un stockage et un trai- 
tement futur possible y compris la transmission sur des liaisons de donnees, impliquant le balayage de 
celles-ci sur le support d'enregistrement sur lequel elies sont presentees survant un balayage de trame 
pixel par pixel et en obtenant une chatne de donnees numeriques a partirde la sortie du dispositif de ba- 
layage, et en partageant ladite image en segments, caracterise par les etapes consistant a determiner le 

25 degre de com press ibilrte d'un segment en prenant en consideration le nombre de suites ininterrompues 

de "1* ou de "0" a I'interieur dudit segment et en affectant conformement audit degre de compressibilrte 
Tun parmi au moins deux taux de compression differents, Tun de ceux-ci entraTnant des pertes, aux don- 
nees dudit segment et en compressant lesdites donnees conformement avec tedtt un taux de compres- 
sion. 



30 
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2. Procede seion la revendication 1 , caracterise en ce que ta longueur moyenne dudit nombre de suites inin- 
terrompues de "1" ou de "0" dans un segment et determine de facon a determiner le degre de compres- 
sibility dudit un segment 

3. Procede seion Tune quelconque des revendications 1 ou 2, caracterise en ce que au debut de chaque 
segment, deux registres destines a memoriser un premier et un second nombre (m, n) sont effaces, en 
ce que toutes tes longueurs (w) de suites ininterrompues de "0" ou de "1" sont determin6es, en ce qu'une 
longueur (w) est plus grande qu'une valeur de longueur predetermine (wj, celle-ci est ecartee sinon, le 
premier nombre (n) est increments de 1 et le second nombre (m) de w, et en ce qu'a la fin du segment la 
faisabilite de la compression est determinee en verif iant si n est inferieur a un premier seuil (nj prede- 
termine, ou si le rapport m/n depasse un second seuil predetermine ([m/nJt), auxquels cas une compres- 
sion a pertes est effectuee alors que dans tous les autres cas une compression sans pertes est effectuee. 

4. Procede seion la revendication 3, caracterise en ce que ladite valeur de longueur predeterminee (wj est 
12, en ce que ledit premier seuil (nj est 5, et en ce que (edit second seuil ([m/nJt) est 4. 



5. Procede seion Tune quelconque des revendications 1 ou 2, caracterise en ce que pour chaque segment 
le nombre (m) d'octets non nuls, contenant au moins un "1" respectlvement, est compte et que chaque 
tel octet est transpose en un nombre binaire qui - est egal a 1 si I'octet que nous considerons contient un 
petit intervalle noir ou blanc, c'est-a-dire que cet octet n'est pas egal a soit 00000000 ou 1 1 1 1 1 1 1 1 et que 

so le premier bit de I'octet est egal au dernier, et qui - est egal a 0 dans tous les autres cas, en ce que le 

nombre (n) de paires de "1 " consecutifs est compte, et en ce qu'une compression a pertes est effectuee 
si n est plus petit qu'un seuil predetermine et/ou si n est plus petit qu'une fraction predeterminee de m. 

6. Procede conforme a la revendication 5, caracterise en ce que le seuil est 3 et en ce que la fraction est 
55 6gale a 5 %. 

7. Procede seion Tune quelconque des revendications precedentes, caracterise en ce que ladite etape de 
determination des degres de compress ibi lite des particularity individuelles de I'image originale est pre- 
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cedee d'une etape de pre-f iltrage appliquant le f iltrage des donnees balayees avant ta compression au 
moyen d'une modification forcee de tous les pixels "1" soles, de meme que toutes les sequences hori- 
zontales et verticales de pixels "1" a "0" si lesdites sequences sont plus courtes qu'une longueur prede- 
terminee. 

5 

8. Precede selon la revendication 7, caracterise en ce que ladite longueur predeterminee est etablie a trois 
pixels. 

9. Precede selon Tune quelconque des revendications precedentes caracterise en ce les segments sont de- 
10 finis en partageant la zone d'image entiere en segments mutuetlement exclusifs de taille uniforme. 

1 0. Procede selon la revendication 9, caracterise en ce que les segments mutuellement exclusifs correspon- 
dent chacun a une ligne ou a deux lignes consecutives de balayage de I'image de texte binaire original. 

11. Procede selon Tune quelconque des revendications precedentes, caracterise en ce que la compression 
15 a pertes implique les etapes consistant a def inir des blocs 2 x 2 de pixels et a developper pour chaque 

tel bloc une representation a 1 bit en appliquant une valeur "0" a tous les blocs presentant moins de deux 
pixels noirs, et une valeur "1° a tous les blocs presentant plus de deux pixels noirs, et en affectant une 
valeur °0" a tous les blocs presentant exactement deux pixels noirs, a condition qu'une telle affectation 
ne coupe pas une liaison entre deux lignes et ne deforme une courbure de ligne. 

20 
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FIG. 1 
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FIG. 5 
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